Corpus: rus_news_2009_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 96 99 99 99 99
1000 891 977 990 997 999
10000 6835 9135 9708 9879 9963
100000 16591 25776 28582 29417 29762
1000000 16591 25776 28582 29417 29762


Zipf's diagram for sentence endings


Gnuplot diagram

2085 msec needed at 2018-03-22 14:27